Classification of unlabeled online media
نویسندگان
چکیده
Abstract This work investigates the ability to classify misinformation in online social media networks a manner that avoids need for ground truth labels. Rather than approach classification problem as task humans or machine learning algorithms, this leverages user–user and user–media (i.e.,media likes) interactions infer type of information (fake vs. authentic) being spread, without needing know actual details itself. To study inception evolution over time, we create an experimental platform mimics functionality real-world networks. We develop graphical model considers network topology uncertainty (entropy) propagation when fake authentic disseminates across network. The creation enables wide range hypotheses be tested pertaining users, their with other content. discovery entropy approximate likes, us unsupervised manner.
منابع مشابه
Learning Classification with Unlabeled Data
One of the advantages of supervised learning is that the final error metric is available during training. For classifiers, the algorithm can directly reduce the number of misclassifications on the training set. Unfortunately, when modeling human learning or constructing classifiers for autonomous robots, supervisory labels are often not available or too expensive. In this paper we show that we ...
متن کاملAutomatic Webpage Classification Enhanced by Unlabeled Data
This paper describes a novel method for webpage classification that uses unlabeled data. The proposed method is based on a sequential learning of the classifiers which are trained on a small number of labeled data and then augmented by a large number of unlabeled data. By taking advantage of unlabeled data, the effective number of labeled data needed is significantly reduced and the classificat...
متن کاملPositive Unlabeled Learning for Data Stream Classification
Learning from positive and unlabeled examples (PU learning) has been investigated in recent years as an alternative learning model for dealing with situations where negative training examples are not available. It has many real world applications, but it has yet to be applied in the data stream environment where it is highly possible that only a small set of positive data and no negative data i...
متن کاملPositive Unlabeled Leaning for Time Series Classification
In many real-world applications of the time series classification problem, not only could the negative training instances be missing, the number of positive instances available for learning may also be rather limited. This has motivated the development of new classification algorithms that can learn from a small set P of labeled seed positive instances augmented with a set U of unlabeled instan...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Scientific Reports
سال: 2021
ISSN: ['2045-2322']
DOI: https://doi.org/10.1038/s41598-021-85608-5